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Abstract 

We introduce a lazy approach to the explanation-based ap¬ 
proximation of probabilistic logic programs. It uses only the 
most significant part of the program when searching for ex¬ 
planations. The result is a fast and anytime approximate in¬ 
ference algorithm which returns hard lower and upper bounds 
on the exact probability. We experimentally show that this 
method outperforms state-of-the-art approximate inference. 


1 Introduction 


Probabilistic logic programming (PLP) languages extend 
logical languages with probabilities. Examples of such 
languages are PRISM (S ato 1995 ). ICL ( Poole 2008] ), 
LPADs (IVennekens, Verbaeten, and Bruynooghe 2004) and 
ProbLog (De Raedt, Kimmig, and Toivonen 


A typical inference task in these languages is calculating 
the probability of a query. It is often computed by transform¬ 
ing the program into a weighted propositional formula and 
subsequently calculating the weighted model count (WMC) 
(IChavira and Darwiche 20081 ) of the formula. When for¬ 
mulae become large, computing the WMC becomes pro¬ 
hibitively expensive and approximating methods are needed. 

Explanation-based inference methods 

( [Kimmig et al. 2008) approximate the probability by 
constructing a smaller propositional formula on which the 
WMC is calculated. Earlier work (IRenkens et al. 20f4l ) 
developed an explanation-based algorithm in the presence 
of negation. The downside of this approach is that it uses 
the entire program when constructing the formula. We 
introduce a lazy approach that avoids this, which enables 
it to get results, even when the program is very large. The 
result is a fast anytime approximate inference algorithm 
which returns hard lower and upper bounds on the exact 
probability. 

In the remainder of the paper we will give some back¬ 
ground on PLP and explanation-based approximation in 
Section 0 explain the intuition behind the lazy approach in 
Section[3] provide experimental results in Section[4]and con¬ 
clude in Section [5] 


2 Probabilistic Logic Programming 

Most PLP languages are based on Sato’s distribution seman¬ 
tics dSato 19951) . In this paper, we will use ProbLog as the 
example language. We will limit ourselves to ground pro¬ 
grams but the approach is easily extended to non-ground 
programs. 

A ProbLog program consists of two parts: a set of inde¬ 
pendent probabilistic facts T and a set of rules 1Z. Each 
probabilistic fact has the form p : : atom, stating that atom 
is true with probability p. Each rule is a logic programming 
rule of the form head : - body . where head is an atom 
and body is a conjunction of literals. Atoms that appear as 
the head of a rule are called defined atoms and we assume 
the set of defined atoms and probabilistic facts to be disjoint. 

A full interpretation of the probabilistic facts is called a 
possible world W. The probability of a possible world is 
defined as the product of the probabilities of the literals in 
the world. Together with the rules 1Z, each possible world 
assigns unique truth values to the defined atoms. When W U 
1Z |= q holds, we say that q is true in the possible world 
W. When calculating the probability of a query q, only the 
worlds in which the query is true are taken into account: 

^ > ( < z ) = Swut ?.|=<2 riiew ^*(0 

An explanation £ for a query is a partial interpretation of 
the probabilistic facts. It specifies a set of possible worlds 
in which the query is true. Since the probabilistic facts are 
independent, the probability of an explanation can be calcu¬ 
lated as: P{£) = n, e£ P{1) 

Renkens et al. (2014) use explanations to construct for¬ 
mulae for the upper as well as lower bound of a query q. 
When we write an explanation as a conjunction of literals, it 
is clearly a formula for the lower bound of the query. It cap¬ 
tures a set of possible worlds in which q is true, but not all 
of them. This lower bound can be improved by taking the 
disjunction of multiple explanations. Similarly, a disjunc¬ 
tion of explanation for -<q leads to an upper bound on the 
probability. Renkens et al. (2014) search optimal (highest 
probability) explanations for q and -> q, by transformation to 
a weighted partial MAX-SAT problem, until time runs out. 
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3 Lazy search 

The approximation algorithm used in this paper is identical 
to the one in (iRenkens et al. 20141 except for one key part. 
In contrast to Renkens et al. (2014), we only use part of the 
program (the lazy program) when searching explanations. 
This can lead to efficiency gains when the original program 
is big. We will show the intuition behind this approach on 
a toy example. It defines paths in a network, p (x, y) and 
e (x, y) state that there is a path and edge respectively, be¬ 
tween nodes x and y. We show the rules and facts for query 
p (1, 4), for which we will search explanations: 

Rules: {p(l,4) :-e(l,2),p(2,4);p(2,4) :-e(2,4); 

p(l,4):-e(l,3),p(3,4);p(3,4):-e(3,4)} 

Facts: (0.8::e(l,2);0.1::e(l,3);0.5::e(2,4); 

0.4::e(3,4)} 

When the search is started, we add all probabilistic facts to 
the lazy program. However, we add none of the rules to the 
program and instead add for each head of a rule, a weighted 
fact ( 1 ; 1 ) : : head. The fact head will have a weight equal 
to one both when it is true as well as false. This means that 
if head receives a truth value in an explanation, it always 
multiplies the probability of the explanation with one. We 
call head unexpanded. 

Rules: 0 

Facts: (0.8::e(l,2);0.1::e(l,3);0.5::e(2,4); 

0.4::e(3,4); (1;1) ::p(l,4); (1;1) ::p(2,4); 

(1;1) : :p(3,4) } 

Subsequently, the optimal explanation for p (1, 4) is 
searched in the lazy program. This is done in the same 
way as in Renkens et al. (2014) but is easier since the lazy 
program is smaller. The resulting explanation is {p (1,4)}. 
When the optimal explanation in the lazy program contains 
unexpanded heads, they are replaced by their rules and the 
search for the optimal explanation is repeated. The program 
for the next iteration is: 

Rules: {p (1,4) :-e (1,2) ,p (2, 4); 

p(l,4):-e(l,3),p(3,4)} 

Facts: {0.8::e(l,2);0.1::e(l,3);0.5::e(2,4); 

0.4::e(3,4); (1;1) ::p(2,4); (1;1) ::p(3,4)} 

Again the optimal explanation (e(l,2);p(2,4)} is 
searched and (1; 1) : : p (2,4) is replaced by its rules. 

Rules: {p (1,4) : -e (1,2) , p (2, 4); p (2,4) : -e (2,4); 

p(l,4):-e(1,3),p(3,4)} 

Facts: (0.8::e(l,2);0.1::e(l,3);0.5::e(2,4); 

0.4::e(3,4); (1; 1) ::p(3,4)} 

Now, the optimal explanation {e(l,2);e(2,4)} does 
not contain any unexpanded heads. In general, optimal ex¬ 
planations without unexpanded heads are also optimal ex¬ 
planations in the original program. Any explanation in the 
lazy program, containing unexpanded heads, needs addi¬ 
tional facts to make it an explanation in the original program. 
This can never increase its probability. 


4 Experiments 

We experimentally evaluate our approximate infer¬ 
ence algorithm by comparing to the non-lazy approach 
(iRenkens et al. 20141) and a state-of-the-art forward reason¬ 
ing approach, which we will call TP (iVlasselaer et al. 20151 1. 
We evaluate 500 queries on the biological network of 
( Ourfali et al. 2007 ) with a timeout of 15 minutes per query. 
The results can be found in Table [I] They show that the lazy 
approach outperforms both other approaches. 



non-lazy 

TP 

lazy 

Almost Exact 

0 

30 

89 

Tight Bound 

0 

207 

272 

Loose Bound 

0 

263 

139 

No Answer 

500 

0 

0 


Table 1: The number of queries for which the difference between 
upper and lower bound is < 0.01 (Almost Exact), in [0.01, 0.25) 
(Tight Bounds), in [0.25, 1) (Tight Bounds) and = 1 (No Answer) 


5 Conclusions 

We have proposed a lazy approach to explanation-based ap¬ 
proximation for probabilistic logic programs. This approach 
outperforms non-lazy explanation-based methods as well as 
other state-of-the-art approaches when programs are large. 
While this paper only discusses the ground case, all tech¬ 
niques can be extended to the non-ground case. 
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